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BOOTSTRAP FOR [/-STATISTICS: A NEW APPROACH 


OLIMJON SH. SHARIPOV, JOHANNES TEWES, MARTIN WENDLER 


Abstract. Bootstrap for nonlinear statistics like [/-statistics of dependent 
data has been studied by several authors. This is typically done by producing 
a bootstrap version of the sample and plugging it into the statistic. We suggest 
an alternative approach of getting a bootstrap version of [/-statistics, which 
can be described as a compromise between bootstrap and subsampling. We 
will show the consistency of the new method and compare its finite sample 
properties in a simulation study. 


1. Introduction 

In many statistical applications, the asymptotic limit distribution cannot be used 
to construct tests or confidence intervals, as the limit might be unknown or depen¬ 
dent on unknown parameters, which are hard to estimate. Bootstrap versions of 
the statistical estimator provide a nonparametric alternative, so establishing the 
consistency of such bootstrap procedure is important. In the case of independent 
and identically distributed (iid) random variables, the main idea of the bootstrap 
consists of replacing the original sample Xi ,..., A„ of observations with unknown 
marginal distribution function F(x) by a new iid sample AJ",..., A* with the mar¬ 
ginal distribution function Fn{x) which is the empirical distribution function con¬ 
structed from the original sample (see Efron 0 )- As a bootstrap version of a 
statistic r(Ai,..., A„) one can take the plug-in version T(A(‘,..., A*). The validity 
of bootstrap for the sample mean of iid observations was first established by Bickel 
and Freedman [5] and Singh m- It is well-known that in some cases the bootstrap 
method provides a better approximation to the distribution of the statistic than, 
for instance, normal approximation (see Hall HO]). Especially when sample size is 
relatively small, bootstrap has some preferences. 

In the case of dependent observations this idea of getting a new sample does 
not work since a new iid sample does not capture the dependence structure, see 
Singh [3T]. Therefore several so-called blockwise bootstrap methods of getting a 
new sample were introduced (for overview of blockwise bootstrap methods see Lahiri 
[12)1. In all these methods, we resample from blocks of I consecutive observations, so 
that inside the blocks the dependence structure will be kept. We will consider two of 
them: The circular block bootstrap method introduced by Politis and Romano [15] 
and nonoverlapping block bootstrap method by Carlstein |3| . In the case of circular 
block bootstrap method we extend the original sample to Ai,..., A„, Ai,..., Ai_i 
where I = l{n) is a block length. To get a new sample A*j,...,A*^; we choose 
randomly and independently I consecutive observations of the sample m = [j] times 
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with 


P = X,, = X,+,_i|Xi, • • • ,^n) = ^ 

for j = 1,..., n, fc = 0,..., m — 1. Here and in what follows we denote by P*,E*, Var* 
the conditional probability, conditional expectation and conditional variance re¬ 
spectively. For instance the bootstrap version of sample mean X„ will be 


^ ml 




In the case of nonoverlapping block bootstrap method we construct a new sample 
X 2 by choosing randomly and independently blocks of I consecutive 
observations of the sample Xi, as above m = [j] times with 

P*(X2%, + 1 = = X , i ) = 1 


for j = 1,m, A: = 0,m — 1. In this case the bootstrap version of the sample 
mean is 


ml 




2 = 1 

For weakly dependent observations strong consistency (almost surely convergence) 
of the X„ and X„ were proved in Shao and Yu [T9] and Peligrad [T8|. Gongalves 
and White [^ and Dehling, Sharipov and Wendler |5] extended these results to 
functionals of mixing processes. In this paper we will concentrate on bootstrap for 
t/-statistics of weakly dependent observations. Bootstrap for [/-statistics of inde¬ 
pendent observations were studied by Bickel and Freedman [5], Arcones and Gine 
[H, Dehling and Mikosch [1|, Leucht and Neumann [Tg. Recently, Dehling and 
Wendler |6|, Sharipov and Wendler |20| and Leucht and Neumann m established 
the consistency of bootstrap estimators for U -statistics of weakly dependent obser¬ 
vations. In the aforementioned papers the same idea of getting bootstrap versions 
of [/-statistics have been used, which is based on the plug-in principle. We will 
now introduce [/-statistics. Let (X„)„g]N be a stationary sequence of random vari¬ 
ables with a common distribution function F{x). For simplicity reasons we consider 
[/-statistics of degree two i.e.: 


Un{h) 


2 

n{n — 1) 




where h : R x R ^ R is the symmetric and measurable kernel. The kernel h{x,y) 
is called degenerate, if 


Eh{Xi,x) = 0 for all x £ R. 

If this does not hold we call the kernel h{x,y) and the corresponding [/-statistics 
nondegenerate. In what follows we will only consider [/-statistics with a nonde¬ 
generate kernel. In the case of iid observations Xi, ...,X„ Bickel and Freedman |2] 
used the bootstrap sample XJ",..., X* of conditional independent observations with 
common distribution function En{x), which is the empirical distribution function 
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constructed by original sample. In order to get bootstrap versions of {/-statistics 
they plugged the bootstrapped observations in {/-statistics i.e. 



In the case of dependent observations, the same idea was explored by Dehling, 
Wendler [6] using the circular block bootstrap method, the corresponding bootstrap 
version is 



while in the case of nonoverlapping block bootstrap method the corresponding 
bootstrap version of {/-statistics is 



Consistency of U*{h) (and U*f,{h),k = 1,2 ) can be proved using the Hoeffding 
decomposition (see [TT]1: 



where 


e-.= Eh (Xi,X2), 

hi{x) := Eh{x,X 2 ) — 0, 
h 2 ix,y) ■■= h{x,y) - hi{x) - hi{y) - 9. 


In order to prove consistency of U*{h) it is enough to show the convergence of 
bootstrapped distribution of the sample mean for the second summand of this 
decomposition and convergence to zero (in probability or almost surely) of the 
third summand. The goal of this paper is to suggest a new resampling method for 
{/-statistics. The main idea is the following: We suggest to draw with replacement 
from the {/-statistics calculated on subsamples instead of drawing with replacement 
from blocks of observations and then plug them in {/-statistics. We introduce 
subsets {!,..., n}^ by 


Bi(i) = {(a,6) \ i < a,b < i + I — \,a < h} { = 1 ,..., n, 

, 'fl 

^ 2(0 = {(a, 6) : {(i — 1) -I- 1 < a, 6 < {{, a < 6} i = 1, ..., m = [--] 


where I = l{n) —>• 00 us called block length. Furthermore, let {T’i(j), j > 1} and 
{ 22 (j)) j > 1} be two sequences of iid random variables with distributions: 


P{Ti{l)=i) 


1 


i = l,...,n. 


n 


P(T2(1)=z) 


1 


z = 1, m. 


m 
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A fixed realisations of the sample Xi, we denote by xi, We set 

2 ^ 

Xiii) = y] h{xa,Xb), i = 

{a,b)GB[ (i) 

2 ^ 

= TT^ITn 51 Hxa,Xb), i = 

n 

u]* = y]/(Ti(j) = i)xi{i) = xi{Ti{j)), 

m 

“f = i)x2{i) = X2{T2{j)), 

i=l 


where /(•) is the indicator function. So a;i(i) respectively X 2 (i) are the 17-statistics 
calculated from the i-th block and Uj* and Uj* are the results of the drawing with 
replacement from these {/-statistics. In the case of xi(i) we assume that Xk = Xk-n 
if fc > n. As the bootstrap versions of {/-statistics {/„(/i) we take the following: 


- m 

ui:{h) = -Y,u]\ 

m 

2=1 

- m 

u^*ih) = -j2 


2 * 
'3 ' 


2=1 


Note that 


^ m 

= E h{xa,Xb). 

*=1 (a,b)£B'^{i) 


In the next sections we will state our main result: the strong consistency of the 
bootstrap based on U^*{h), fc = 1, 2. The new approach reduces the computational 
burden of the Monte Carlo method usually used for the bootstrap. After calculating 
the values u^*, for every run of the Monte Carlo evaluation one need only 0(”) 
calculation steps, while for the plug-in method, in every run the {/-statistic has to 
be calculated again in O(n^) steps. As the linear parts of the plug-in bootstrap 
version and the new bootstrap version of a {/-statistic are the same, we expect a 
similar behaviour. 

We will give an upper bound for the mean square error (MSE) of the variances 
of U^*{h), fc = 1, 2, which suggest a choice of the block length of order 0{^/n). In 
section 3, we will present our simulation results. We consider the sample variance 
as a {/-statistic and we will compare the new bootstrap approach with plug-in 
bootstrap and subsampling. The proofs of the main results will be given in section 
4. 


2. Main results 

Let be a stationary sequence of random variables with values in a sep¬ 

arable linear space. We will assume that this sequence satisfies some form of short 
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range dependence. Namely we will consider strong mixing and absolute regularity 
conditions. Recall that strong mixing coefficients (a(/c))fcg]N and absolute regularity 
coefficients (/3(fc))feg]N are defined as 

a{n) = snp{\P{AB) - P{A)P{B)\ A e ,B & e N}, 

/3(n) = sup{i?(sup \P{B/P^) - P{B)\ : B e 7"“ kelM} 
where 7^ is the cr - field generated by ..., X;,. Now we can formulate our results. 


Theorem 2.1. Let (X„)^g]^ be a stationary sequence of absolutely regular random 
variables. Assume that the following conditions hold (for some 5 > 0) 

• JJ \h {xi,X 2 )\^~^^ dP (xi) dF {X 2 ) < M for some M < 00 , 

• E\h{Xi,Xi+k)\^"^^ <M for all k&m, 

• P {n) = O {n~P) for some p > 

• l{n) < Cn^ for some e £ (0,1). 

Then as n —>■ 00 for i = 1, 2 


sup 

aiGE, 


p* 


V^l{UC{h)-E* [UC])<x 


P [s/n ([/„ (h) -9) <x] 


0 . 


in probability. 



Var [VnUn {h)] 


0 , 


In the next theorem we consider strongly mixing random variables. This is a 
weaker assumption on the dependence, but in this case we assume that the kernel 
satisfies the following condition: We say that a kernel is 7-Lipschitz-continuous, if 
there exists a constant C > 0 such that 


E [\h (X, Y)-h (X', r)| / (|X - X'l < e)] < Ce 

for every e > 0, every pair X and Y with the common distribution Vxi,Xk for some 
fc G IN or Vxi X Vxi and X' and Y also with one of these common distributions. 
For the examples of kernels which satisfy above condition see [5] , [5D] . 


Theorem 2.2. Let (X„)j^gj.j be a stationary sequence of strongly mixing random 
variables. Assume that the following conditions hold 

• JJ \h{xi,X2)f~^^ dF {xi) dE {X 2 ) < M, 

• 7 |h (Xi, Xi+fc)|^+'^ < M/or aZZ fc G IN, 

• l{n) < Cn'^ for some e G (0,1), 

• h is V-Lipschitz-continuous, 

• E |Xip > 0 for some 7 > 0, 

• a{n) = 0{n~P) for some p > • 


Then the statements of Theorem 2.1 hold. 


In the next theorem we give bounds for the mean squared error to give a deeper 
insight into the properties of bootstrap versions of C/-statistics. 


Theorem 2.3. Let be a stationary sequence of absolutely regular random 

variables. Assume that the following conditions hold: 

• JJ jh {xi,X 2 )\^^^^ dE (ti) dF (X 2 ) < M for some <5 > 0, 

. 7|h(Xi,Xi+^) 


|4+3<5 < M for all m G IN, 
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3(6 + i5 ) 


for some 0 < 5 <6. 


• 13 {n) = O In /i' 


Then for k = 1,2 


E (var*(V^C/^*(/i)) - Var(v^t/„(/i)))"" = 0(y + 


Choosing a block length of order Ijyjn, we can achieve that the mean squared 
error oiYa. 1 *{\/rnlU^*{h)) is of order 0(l/-y/n). 


3. Simulation 

We consider the estimator cr^ for the variance cr^ = Var{Xi) which is a t7- 
statistic with the kernel h(x,y) = -{x — y)^: 



(Xn) ngK is a stationary, Gaussian, autoregressive process with Xn = aX„-i + Cn, 
where (£n)nGiN is a sequence of iid standard normal random variables and a G 
{0.2, 0.4,0.6}. We will compare three methods for constructing confidence intervals: 

• The circular plug-in bootstrap (already used in Dehling, Wendler [^). 

• The new bootstrap version Uf*, the drawing with replacement from U- 
statistics caculated on subsamples. 

• Subsampling: The estimator of the unkown distribution is given by the 
empirical distribution function of the [/-statistics caculated on subsamples, 
see Politis and Romano m- 

For each combination of construction method, AR-coefhcient, sample size (n = 
50,100, 200,400) and block length, we have simulated 10.000 samples and evaluated 
the empirical probability of the 95% confidence interval to cover the true parameter. 
The two bootstrap version are evaluated by the Monte Carlo method with 1000 
times drawing with replacement. The results are summarized in the table below. 

In general, we draw the following conclusions from the simulation study: All 
three methods lead to confidence intervals with a coverage probabilty lower than 
the nominal confidence level of 95%. The performance of the subsampling is worse 
than the performance of two bootstrap methods in all situations. If the dependence 
in the AR-process is weak (a = 0.2), both bootstrap methods lead to comparable 
results, while for stronger dependence {a = 0.4, 0.6), the plug-in bootstrap has a 
better performance. 

For the plug-in bootstrap, the block length can be choosen smaller than for 
the new bootstrap method or for subsampling, especially in the case of stronger 
dependence {a = 0.4,0.6). In Figur 1 below, we give a more detailed picture of the 
coverage probabilities for different block lengths in the case n = 200, a = 0.4. 


4. Proofs of the theorems 


4.1. Preliminary results. In this section we formulate some necessary results 
that will be used in the proofs of our theorems. 
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a 

n 1 

0.2 

0.4 

0.6 

50 3 

5 

7 

10 

0.876/0.895/0.827 

0.865/0.884/0.823 

0.862/0.871/0.814 

0.835/0.849/0.787 

0.863/0.791/0.693 

0.838/0.819/0.737 

0.823/0.818/0.753 

0.808/0.814/0.743 

0.794/0.600/0.569 

0.765/0.677/0.583 

0.762/0.718/0.614 

0.757/0.730/0.607 

100 3 

5 

7 

10 

0.908/0.922/0.823 

0.911/0.910/0.852 

0.909/0.909/0.849 

0.890/0.902/0.851 

0.894/0.817/0.730 

0.896/0.838/0.770 

0.874/0.857/0.793 

0.880/0.859/0.815 

0.852/0.600/0.507 

0.851/0.682/0.617 

0.853/0.737/0.676 

0.849/0.767/0.716 

200 5 

7 

10 

15 

0.926/0.930/0.873 

0.925/0.926/0.881 

0.924/0.920/0.889 

0.910/0.910/0.884 

0.908/0.859/0.813 

0.905/0.870/0.831 

0.911/0.885/0.857 

0.902/0.893/0.849 

0.883/0.684/0.620 

0.891/0.738/0.704 

0.887/0.805/0.761 

0.885/0.835/0.800 

400 7 

10 
15 
20 

0.927/0.932/0.896 

0.931/0.927/0.906 

0.923/0.929/0.901 

0.924/0.918/0.906 

0.927/0.887/0.855 

0.924/0.901/0.873 

0.923/0.904/0.882 

0.902/0.909/0.887 

0.920/0.748/0.711 

0.920/0.814/0.777 

0.902/0.847/0.828 

0.904/0.866/0.849 


Table 1. Simulated coverage propabilites for plug-in bootstrap/ 
new bootstrap/ subsampling, sample size n = 50,100,200,400, 
AR-coefficient a = 0.2, 0.4,0.6, block length / = 3, 5, 7,10,15, 20 



Figure 1 . Simulated coverage probabilities for plug-in bootstrap 
(grey), new bootstrap (black), subsampling (dashed) as a function 
of the block length I, sample size n = 200, AR-coefficient a = 0.4 

Lemma 4.1. (Dehling,Wendler [7]/ Let (X„)^g]]^ be a stationary sequenee and 
h{x,y) a degenerate kernel satisfying JJ \h{xi,X 2 )\^^~^^ dF (xi) dF {X 2 ) < M for 
some 5 > 0 and E\h{Xi,Xi+rn)\'^'^^^ < M for all m € IN. Let be t > 0 such that 
one of the following two conditions holds: 

(1) (Ar„)^gj]^ is absolutely regular and (^) “ 0{n'^). 
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(2) is strongly mixing, E\Xi\'^ < oo for a j > 0, h is V-Lipschitz- 

x~^Tl 2'y(5 

continuous with constant C > 0 and X]fc=o ka~i‘’+^+^r +2 (fc) = O (n'^). 


Then: 


\E[h2iX,„X,,)h2iX,^,X,J]\=0{n^+^), 

'll »^2 )*3:^4 = 1 


Lemma 4.2. (Yoshihara Let (X„)^g]pg be a stationary sequence of absolutely 
regular random variables. Assume that the the following conditions hold (for some 
S > 0) and 

JJ \h{xi,X2)\'^^^dF{xi)dE{x2)<M, 

E\h{Xi,Xi+k)f^^ < M for allkem 

/ 3(4 + (5') \ 

fj (n) = O \ n ■ 5 ' J for some 0 < 6 < 6. 

Then ^ 

E {n^U^(h 2 )) = 0(n-i-''), 77 = min 1l| 

Lemma 4.3. (Yokoyama |22j ) Let be a stationary strongly mixing se¬ 

quence of random variables with EXi = pL and ||Xi|j 2 _|_^ < 00 for some 0 < S < 00 . 
Suppose that 2 < s < 2 -\- 5 and 


E 4 -I / / ^^(2+<5-s)/(2+^) ^ 

{a{n)y ' < 00 


n—1 

Then there exists a constant C depending only on s, 5 and the mixing coefficients 
(a(n))^g]^ such that 


n 


s 


E 








4.2. Proof of Theorem 12.11 Let us introduce blocks of indices 


Bi{i) = {i,... ,i-\-1 - 1} , i = l,...,n, 

B 2 ii) = {{i - l)l + I, ■. ■ ,il} , i = 1,... ,m. 

Note that the blocks = l,...,n} and {B 2 {i),i = l,...,m} correspond to 

the circular and nonoverlapping blocking methods, respectively. In the circular 
blocking method instead of the sample Xi, ..., Xn we consider the completed sample 
Xi,Xn, Xi,Xi-i- A fixed realization of the sample Ai,...,A„ we denote by 
Xi, ..., Xn and set 

= y Xk,i = l,...,n, 

k^Bi (j) 

^2,i(*) = y Xk,i = l,:.,m, 

k^B2{i) 

n 

X]* = Y,m{j) = i)x,^^{i), 

m 

Af =^/(T2(j)=*)x2 .l(z), 
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- rn 

^n,m ^ 2-^ ’ 

- m 

= _L ^2* 

^n,m ^ ■ 

i=l 


First we will prove the statement of the theorem for i = 1 (circular bootstrap). 
Using Hoeffding decomposition we have 

1 


K* (h) - E* K*]) = - 5: (uf - E*Ul:{h)) 


m 

i=i 

1 r „ 2 2 

“ TO zj ^ + 7 zj hi{xk) + 




h2{Xa,Xb) 


i = l fe6Si(Ti(i)) ' ' (a,f))GB;(Ti(j)) 

2 

m 

i=l 


1=1 fcGBi(i) *=1 (a,f))GB((i) 


= -E 

m f ^ 


i=i 


I 


( y] 7.i(a;fc) - ^ y^/ti(cc^)) 

fc6Bi(TiO)) 

2 


_^ 2 x-Ev 2 _^ 

E ^2(Xa,^6)- y] h2{Xa,Xb)) 

a,6)GB((Ti(i)) ’■=1 (a.b)GB((r) 


^ m . n 


] = 1 fcGBi(TiO')) 

2 


r=l 


ml{l — 1) 

.7 = 1 


E E {h2{Xa,Xb) - E E ,Xb)) 


1 = 1 (a,&)GB;(Ti(i)) 


(a,&)GB((r) 


Using the latter 


2 r 1 "■ 

Vmi ^ L ^ 


t = l fcGBi(TiO)) 


Z=1 


\/Tni{l — 1) 

=In + Iln- 


m 1 ^ 

E[ E {h2{Xa,Xb) - E E h2{Xa,Xb)) 

1 = 1 (a,h)GB((Ti(j)) r=l !„ b^cR' ^■,■\ 


{a,b)&B[{r) 


In order to prove the first statement of Theorem |2.1[ we first show that Iln —>■ 0 as 
n —>■ oo. Note that 


4 n / 1 ^ 

VaElIn = E ( E (^2(a;a,a;b)--y] y] h2(xa,xb) 

»=1 ^ (a,b)eB[{i) i=l <n. b^enili-i 


(a,b)GB( (i) 


1 


n 

E(; 


*=1 ^ ' {a,b)&B[{i) 


1 

(^E 


t{VKi-i) 


E ^2( 


^05 




2 
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By Lemma |4.1| we have as n —>■ oo 

e( ^ 


Y, h2iXa,Xb)) ^0 


and consequently VaElIn —>■ 0 in probability as n —>■ oo. With the Chebyshev 
inequality, it follows that //„ —>■ 0 in conditional probability. 

It remains to show the convergence of By Theorem 3.2 of Lahiri [T^] we have 
as n —)■ oo 

2 


sup 


p 


{ln<x)-P[—YHX.)<:c) 


0 . 


in probability. Furthermore, we have Var (•ynl7„(/i2)) —>■ 0 by Lemma 4.1 and can 
conclude that 


sup 


p 


Y ^ - P{Vn{Un{h) - 9) 


-e)< X 


0 . 


Using Slutzky’s lemma and //„ —>■ 0 in probability, we arrive at at the first statement 
of the Theorem: 


sup 

£CGR 


P* 


^ {K* (h) - [U;*] )<x]-P[V^ ([/„ (h) -9)<x] 


0 . 


For the second statement, we make use of Theorem 3.2 of Lahiri m again, which 
also states that 


Var*/„ - Var( — ^ h(W) 


0 


This together with XaPlIn —>■ 0 in probability and Xai {y/nUn{h 2 )) —t 0 (see 
Lemma 4.1) leads to 


Var* 


(h)j - Var [v^l7„ (h)] 


0 


in probability as n —> oo. 

In the case of i = 2 (nonoverlapping bootstrap) analogously to previous one we 
have 


ih)-E.[U^*]) 

TTL TTL I iitt, 


'll 




k^Bi (i) 


2=1 


2 272 ^ m 

E E (^2(a:a,a:,)--^ ^ h2ixa,Xb)) = 


i=l {a,b)eB[{T2{j)) 

Theorem 3.2 of Lahiri [T^] implies as n —>■ oo 

2 


2=1 


{a,b)GB[ (i) 




sup 

xGR 




^ 0 


Var*4 - Var ( ^ ^ h{X,) 


-A 0 
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in probability as n —>■ oo. It remains to show that Var*//„ —>■ 0 in probability, the 
rest of the proof can be done along the lines of the case i = 1 (circular bootstrap). 
We have 


VarX = Y. I E --Y. Y. *2(a^a,ai6)) 


2 = 1 




2=1 




- m 

= -E( 




^ ^ h2{Xa,Xb)f - nn 11 E h2{Xa,Xb)f. 




*=1 ^ > (a.b)6S)p) 


By Lemma 4.1 we have as n —)■ oo 


E 








SO E (Var*/4) ^ 0. 


4.3. Proof of Theorem 2.2 Theorem 2.2 can be proved in the same way as 


Theorem 2.1 and therefore the proof is omitted. 


4.4. Proof of Theorem |2.3[ To shorten the proof, we concentrate on the case 
k = 1 (circular bootstrap). We split the mean squared error into three parts 


E (Var*(vWt/,J*(h)) - Var(v^C/„ 

- ml ^ n 2 

< 3i?(var*(^ ^ h{Xl,)) - Var(^ ^ h{Xi))) 

'' VmL ^ y/n ^ / 

2=1 2=1 

1 2 

+ 3i?(var*(^ Y hliK^)) - Var*(V;))IC/„^*(h))) 

1 ^ 2 

+ 3£;(Var(v^C/„(h))) - Var(— ^ /ii(W))) = 3/„ + 3//„ + 3///„ 

^ 2=1 

For the first summand, we know by Theorem 3.1 of Politis and White m that 


E = 


For the second summand, we make use of the equation = {a + b){a — b) 

and the Holder-inequality (first for the bootstrap expectation and then for the 
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unconditional expectation) to obtain 

^ ml 

i;fvar*(^ y- Var*(V^C/, 
V \/niI 


ih = 


k* 


( / . ml - ml 

(yjj E hi{Xl,)-E*— y h,iXl,)-V^lUt{h)+E*V^lU::*{h) 

\ y TTl y TTl 

. ml - ml \ \ ^ 

E E - E*V^lU^*{h))j\ 

/ - ml - ml 

< s (i?* (^ y /ii {Xi,)-E* ^ y /Ji (y,)-* (/!)+£;* V^c/, 

\ ^yml 

- ml - ml 

y y + Vmiu!:*{h) - EWmiui 

^yml yml ‘^T 

2=1 2=1 

r - m n 

^ V V ; 2=1 (a,6)GB[(2) 

. ml ^ m n \ 

var*(2^y/ii(yj+^=——yy/(Ti(j) = i) y /i2(^a,xy) 


< E 


;(var*( 


- 1 ) 


yy/(Ti(j) = *) y 


(a,b)G-B[(2) 


2 \ 2 


. ml 

i?(var*(2^y/ii(y,) + 
^ ^ 2=1 


Vto;(Z 1 ) i=l(a,b)GB;(TiO)) 


— ^ Bn- 


We will treat these two factors separately, starting with An- By the definition of 
the bootstrap procedure and the stationarity of the sequence, we get the following: 


An=[E 


;(Var*( 


1 


< EE* 


W7o_iVEE^(^i(^') = *) E h,{Xn,X, 

yml{l 1) (o.&)GB;(d 

yy/(TiO) = *) y h2{Xn,x, 



4\ 2 


'P(;_1)4„E( E h2iXa,Xb) 

^ ' 1=1 (a,b)G-B;(i) 


1 


tE 


E{l-\Y 

oshihara 

i?( y h2(w,x,))" = oy. 


( y h2{Xn,Xb))) =0{^-\ 

(a,b)GBI(d ^ 


as we know from Lemma 3 of Yoshihara [23j that 

, 4 


l<2<J'<fe 
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With the help of the inequality (a + b)^ < 2o^ + 2b^, we split the second summand 
into two parts: 


Br, 


ml 




i=l 


y/rnKl — 1) 


ml 


E h2{X,,X,)) 

j = l (a,b)GBi(TiO)) 

1 

2 \ 2 


2 \ 2 


< 16(^£;(Var*(^E*i(^M))) j + 4^™ = 16-B; + 

so it suffices to study We use again stationarity and the definition of the 
bootstrap: 

/' 1 2\ 2 y 1 ^ 2\ 2 

i?;= (^£;(var*(^E^i(^M))) ) = (^(Var*(^E^i(^M))) j 


2=1 


< ) =(^-E(:77 E 


j+l-1 


5 = 1 *=7 


3 

j+l-1 




The last inequality follows from Lemma 4.3 This shows that //„ = 0(j). In the 
same way one can show that ///„ is of the same order, which completes the proof. 
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